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ON LOGARITHMICALLY BENFORD SEQUENCES 


EVAN CHEN, PETER S. PARK, AND ASHVIN A. SWAMINATHAN 

Abstract. Let I C N be an infinite subset, and let {ai}i £ x be a sequence of nonzero real 
numbers indexed by X such that there exist positive constants m, C\ for which \a,i\ < C\-i m 
for all i £ X. Furthermore, let Ci £ [—1,1] be defined by C; = for each i £ X, 

and suppose the Cj’s are equidistributed in [—1,1] with respect to a continuous, symmetric 
probability measure /i. In this paper, we show that if X C N is not too sparse, then the 
sequence {cn}i£x fails to obey Benford’s Law with respect to arithmetic density in any 
sufficiently large base, and in fact in any base when y([0,t\) is a strictly convex function of 
t £ (0,1). Nonetheless, we also provide conditions on the density of I C N under which the 
sequence {aj} ie x satisfies Benford’s Law with respect to logarithmic density in every base. 

As an application, we apply our general result to study Benford’s Law-type behavior in 
the leading digits of Frobenius traces of newforms of positive, even weight. Our methods of 
proof build on the work of Jameson, Thorner, and Ye, who studied the particular case of 
newforms without complex multiplication. 


1. Introduction 

It was first noted in 1881 by astronomer Simon Newcomb that when numbers occur in the 
real world, their leading digits tend not to be uniformly distributed. Specifically, Newcomb 
observed while studying tables of logarithms that certain pages were more worn away than 
others, especially those pages corresponding to logarithms whose first digit is 1 du. In 
1938, physicist Frank Benford corroborated this hypothesis in a considerably more general 
setting by testing it on an extensive data set including population sizes, physical constants, 
molecular weights, and even the surface areas of rivers [2], This bias toward certain initial 
digits, which is known as Benford’s Law , has since been discovered to hold for a number of 
distributions that arise in modern mathematics (see HU for an informative exposition on 
the subject). But before we discuss specific examples of sequences that obey Benford’s Law, 
we must pause to state the law in a precise and general manner. 


1.1. Definitions. Let N denote the set of positive integers, and let X C N be an infinite sub¬ 
set. Given an infinite sequence a = {aj}j 6 i of nonzero real numbers indexed by X and a subset 
A C M, we can associate to the pair (a, A) an arithmetic density d(a, A) that is given by 


( 1 . 1 ) 


d(a, A) 


, ffU < x : i G X and a* G A] 

hm -—--- 

z-kx) < x : i E X} 


if the limit exists. The above definition serves as a means of quantifying the density of 
elements of the set A in the sequence a = {ai} ie z- We say that a is ( arithmetically) Benford 
in base b if for any (nonzero) string of base-6 digits Sb, we have that 


d(a, 6, Sb) '■= d(a,{ieR: |x| begins with Sb in base 6}) = log b (l + S b 1 ), 
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where in computing the logarithm, we interpret Sf, as an integer. For example, if a is Benford 
in base 10, then the terms a* start with the digit 1 about d(a, 10,1) ~ 30% of the time and 
with the digit 9 about d(a, 10, 9) ~ 4.5% of the time. 

A number of sequences of arithmetic interest, such as the sequence of factorials and the 
partition function, are Benford in any base b > 2. Benford’s law has also been proven for the 
distribution of values taken by L- functions [7], data from dynamical systems (e.g. linearly- 
dominated systems and nonautonomous dynamical systems) [3], and truncated progressions 
of the 3a; + 1 problem m- 

Nonetheless, there are many natural sequences of numbers that do not satisfy this strong 
property - most notably the set of positive integers N, which fails to be Benford in any base 
b > 2. However, it is possible to show that such sequences still demonstrate a Benford- 
type behavior, as long as we consider a different, more inclusive notion of density. Given a 
sequence a = {aj} ie j and A as before, and letting T< x — {i e I : i < a;}, we can associate 
to the pair (a, A) a logarithmic density <5(a, A) defined as 


( 1 . 2 ) 


S(a, A) 



lim 

£—>•00 


ieTc x 







if the limit exists. We then say that the sequence a is logarithmically Benford in base b if 
for any (nonzero) string of base-6 digits S% we have that 

5(a,b,Sb) '■= <5 (a, {x £ K. : |x| begins with Sb in base b }) = log 6 (l + Sf 1 ). 

It is known (e.g., see [10]) that if the arithmetic density d(a,A) exists, then the logarithmic 
density S(a, A) also exists and equals d(a, A). However, the converse of this statement is 
false; as stated in [4], both the sequence of natural numbers and that of prime numbers are 
logarithmically Benford with respect to any string S), in any base b. Therefore, the condition 
of being logarithmically Benford is strictly weaker than that of being arithmetically Benford. 


Remark. There are other types density, such as Dirichlet density, that are strictly weaker 
than arithmetic density (in fact, Dirichlet density is strictly weaker than logarithmic density). 
In this paper, we restrict our consideration to arithmetic and logarithmic densities, as these 
are the most commonly studied. It may however be interesting to determine whether there 
are sequences of mathematical importance that satisfy Benford’s Law with respect to other 
types of density. For example, it is known that the primes are Benford with respect to 
logarithmic (and hence Dirichlet) density. See [9] for a more description of generalized 
asymptotic densities, and see [14] for a discussion of Dirichlet density in particular. 

1.2. Statement of Results. One interesting occurrence of the logarithmic Benford prop¬ 
erty lies in the study of Fourier coefficients of certain modular forms, namely newforms 
(i.e., holomorphic cuspidal normalized Hecke eigenforms; see |T2J for a standard reference). 
In [6], Jameson, Thorner, and Ye employed the Sato-Tate conjecture to show that for a 
newform / of even weight without complex multiplication, the sequence (a/(p) : p prime} of 
Frobenius traces of / is not arithmetically Benford in any base b > 2, but is logarithmically 
Benford in every base. As we show in this paper, the method of proof employed in [6] can be 
modified to yield more general results about when sequences are logarithmically Benford but 
not arithmetically Benford. Moreover, we prove as a corollary that the theorems of Jameson, 
Thorner, and Ye hold for such newforms with complex multiplication as well. 
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We shall consider sequences a = {a,} e x whose growth is bounded by a power function of 
the index i\ specifically, suppose there exist constants m, C\ > 0 such that |a,| < C\ -i m for all 
iel. As noted in [I], proving that a sequence is Benford usually requires an equidistribution 
result of some sort as input, so we impose the following assumption on our sequence a: taking 
q G [—1,1] to be defined by ry = c °f\ m for each i el, suppose the cfs are equidistributed 
in [—1,1] with respect to a continuous, symmetric probability measure /i. Note that this 
assumption implies that for any [A, B] C [—1,1] we have 


(1.3) 


#{l<^ e la n dc,g[VU} = (1 + 0(1)) . „ ([A B]) , 

ff{i < x : i G 1) 


from which one readily deduces that 


(1.4) Y, I = (1+ «(!))■ MIA B])' £ 

Ci&[A,B\ 


We wish to determine whether the sequence a is arithmetically, or at least logarithmically, 
Benford in any base b > 2. To this end, we prove two main theorems, the first of which is 
stated as follows. 


Theorem 1.1. Retain the above setting, and suppose for every c > 1 that in [x, ex] ^ 0 for 
all sufficiently large x. Then a is not arithmetically Benford in any sufficiently large base. 
If in addition /i([0, t]) is a strictly convex function of t G (0,1), then a is not arithmetically 
Benford in any base b > 2. 

On the other hand, our second theorem indicates conditions under which the sequence a 
is logarithmically Benford in every base b > 2. 

Theorem 1.2. Retain the above setting, and suppose 1 is such that ^2 iGX< \ = (l + o(l))- 
C 2 -g(x), where C 2 > 0 is a constant and where we may take g(x) = logo: or g(x) = log log x. 
Then a is logarithmically Benford in every base b > 2. 

Remark. We note that the assumptions made in stating the above theorems are reasonable. 
Indeed, as shown in [T], certain sequences with more than polynomial growth are known to 
be arithmetically Benford. Furthermore, the proof of Theorem 11.21 depends heavily on the 
particular properties of the functions g(x) = logx and g(x) = log log x. 

The rest of this paper is organized as follows. Section [2] presents the proof of Theorem ll.il 
and Section [3] discusses the proof of Theorem 11.21 Section [4] concludes the paper with an 
application of Theorems II. II and ll.2l to studying Benford’s Law-type behavior in the sequence 
of Frobenius traces of a newform. 


2. Proof of Theorem 11.11 

To begin with, we observe that the proof of Theorem 1 in [6] works, mutatis mutandis , 
to prove that a is not arithmetically Benford in any sufficiently large base. Indeed, the only 
conditions that the proof of [6] ever uses are that 1 have nontrivial intersection with every 
sufficiently large subinterval of N and that p, be symmetric and continuous. However, as we 
will now show, the case where /i([0,t]) is strictly convex has a much cleaner proof. 
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Suppose b > 3, and let 1& be the string of digits whose only character is the digit 1, 
interpreted in the base b. We will prove that when Sb = lb, the limit (II. ip defining the 
arithmetic density 

d(a,b, l b ) 

does not exist, which is enough to imply the theorem in this case. Assume for the sake of 
contradiction that this limit exists, and for every nSN, put 

40, 


iel; — b n < C x ■ i m < 2 b r 
I 23 

Observe that I~ ^ 0 for sufficiently large n because of our assumption on the sparseness of 
IcN. Suppose that for some * 6 /“ there exists nonnegative integer j such that 


23 . . , 

—b 3 < cd < b 
40 “ 1 1 “ 


-3 


Then, we have by the dehnition of c t that 

b n ~ j < | t^l < 2 b n ~ j , 

from which we readily deduce that |oj| begins with the digit 1 in base b. Now, by appealing 
to Theorem 14.11 we have the following lower bound on the desired density: 

d(a,b,l b ) = begin, with 1,} 


n —^oo 
oo 


> 


( 2 . 1 ) 


lim 

' ^ n —>cx 
3 = 0 
oo 


# 4 “ 

#{* G In ■ f 0 b~ j < \ c i\ < b~ j } 


#4 


3=0 


^4-j U-3 
40 ’ 


We wish to derive an upper bound on d(a, b, 1^) in a similar manner. Consider indices i such 
that b n < | a* | < 2 b n , and deffire the interval 

/+ := u el: -b n <C l -i m < -If 
P 2 3 

If |<4 begins with the digit 1, and i G /+, we deduce that | b~ 3 < |q| < for some integer 
j ; but as b > 3 and \ci\ < 1 this only makes sense for j > 0. Thus we obtain an upper bound 


d(a,b, lb) = lim 


#{* G : di begins with It,} 


n—>oo 
oo 


( 2 . 2 ) 


< 


2 5> 

3=0 


# 4 + 

-b~ j , -b~ j 
8 ’ 5 


Combining (12.ip and (12.2p . we get 


3=0 


—b~ j ,b~ j 
40 ’ 


OO 

A 

3=0 




-b~ j ,-b~ j 
8 ’ 5 
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but this contradicts our convexity assumption; we have for all x G (0,1) that 




23 

40*’* 


> [i 


3 4 
8 *’ 5 * 


For b = 2, a similar argument can be employed, but we cannot simply take the string If, 
since d(a, 1, lb) trivially equals 1 = log 2 (l + 1 _1 ). Instead, we repeat the above argument 
using the string 10b; the lower bound is obtained by counting i G X with 

^ ^ A-j |„.| A-3 

15' “ 1 “ 

across the interval ■ 4" < C\ ■ i m < 3 • 4 n , whilst the upper bound is obtained by counting 
i G I with 

2 2 
- - 4< led < - - 4“ J 
5 “ 1 11 - 3 

across the interval § • 4 n < C\ ■ i m < 5 • 4 n . We can then obtain a similar contradiction by 
using the convexity assumption on /x to see that // ([41a;, x]) > /x ([|x, |x]) for all x G (0,1). 


3. Proof of Theorem 11.21 

Recall the assumptions of Theorem 11.21 we take our measure /x to be symmetric and our 
index set X to satisfy JT gX< 7 = (1 + o(l)) • C 2 • g(x), where C 2 > 0 is a constant and where 
g(x) = logo; or g(x) = log log x. To show that a is logarithmically Benford in any base b > 2, 
it suffices to show that 

(3.1) \ = ( 1 + °( 1 )) + S b X ) ■ ( C 2 -9(x)), 

iEX< x 

CLieA(b,S b ) 

where A (6, Sb ) = {x G M : |x| begins with Sb in base b}. Let r G N, to be specified later, and 
split the sum on the left-hand-side of (13.11) into two pieces, according as |c;| < 4 or |cj| > 4 : 

< 3 - 2 > £ 7 = £ 7 + £ 7 - 

^AX< x . i£zX< x (W< 7 . 

ai£A(b,Sb) a,i£A(b,Sb) a,i£A(b,Sb) 

|ci|<l/r |cf|>l/r 

It is fairly straightforward to bound the first sum on the right-hand-side of (13.21) : 

(3.3) E -S E 4 = (l + o(l))- 2 M[ 0 .r- 1 ])-(C 2 -jW). 

, 'll % 

ai£A(b,Sb ) |ci|<l/r 

|ci|<l/r 

Estimating the second sum on the right-hand-side of (13.2 j) is certainly more involved; the 
following Lemma 13.11 shows how this can be done by making explicit use of our assumption 
that g{x) = logic or g{x) = log log x. 
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Lemma 3.1. For an initial string Sb of digits in a given base b > 2 and an integer r > 2, 
we have that as x —> oo, 


(1 + o(l)) • (log 6 (l + S b x ) - log 6 (l + r x )) ■ (C 2 ■ g(x)) 



<HeA(b,S b ) 

\ci\>l/r 


< (1 + o(l)) • (log b (l + S b x ) + log 6 (l + r x )) • (C 2 ■ g(x)) + K ■ g(r), 


for some constant K >0, possibly depending on the fixed parameters b, Sb, m, and C\. 


Proof. In what follows we shall give a proof of the upper bound; we omit the proof of the 
lower bound because it is analogous. Upon observing that we may write 

OO 

A(b, S b )= 1J {x G R : |x| G [S b ■ F, (S b + 1) • F)}, 

t= — OO 


we can split the desired sum as follows: 

i °° _ i 

E t=E E 

^— OO 

ai eA(b,S b ) S 6 -& t <|a i |<(S6+l)-& t 

\ci\>l/r \ci\>l/r 

When t < 0 and \ci\ > 1/r, the condition that S& • F < |a*| = C\ ■ i rn ■ |q| < (Sb + 1) ■ F 

, so the terms with t < 0 in (13. 4 p can be bounded as follows: 


E E -< K -aM 

i&Z< x iOZ 

S(,-6 t <|ai|<(Si,+l)-6 t f q gfeT 1 A m 

\ci\>l/r ~ V c r b J 

for some constant K > 0, possibly depending on the fixed parameters m, b, Sb, and C\. We 
may now restrict our attention to the terms with t > 0 in (13.4(1 : to bound these terms, we 
split the sum even further: 



implies that i < ( ) 


OO 



t =0 i€Z< x 

5 6 -b t <|a i |<(S il +l)-6 t 

|ci|>l/r 


1 

i 


r 2 — 1 oo .. 

EE E 7. 

j=r t =0 i(ZZ< x 

j/r 2 <\ci\<(j+l)/r 2 


where we can afford to be loose about the order of summation because the contribution is 
0 for all but finitely many values of t. When 4 < |q| < the condition Sb ■ F < |a*| = 

Ci ■ i m ■ \ci\ < (S b + 1) • F implies that (^cpfi+T) 7 ' 2 ') — ' l — r2 ) ™ ! this observation 

along with (11.41) and the condition that // is symmetric allows us to make the following 
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estimates: 


OO 

E E 

t =0 iGl< x 

Sh-tf^ailKiSb+iyb* 

\ci\>l/r 


1 

- < 

i 


r 2 — l oo 

EE 


j=r t =0 



ieX< x 


Ci-O+ i) ’ 


<i< 




j/r 2 <|c i |<(. 7 +l)/r 2 


m 


1 

i 


r 2 — l oo 

= ^(1 + o(1)).2M[^,^])E 

j=r i=0 


_Sb±_ r 2 

cRT+iJ 


E 

-V c rJ 


1 

z 


The condition that i < x implies that for a given j, all terms with > x, or 

equivalently t > log fe d 0 no t contribute to the sum. If we take g(x) = logo;, we have 


E E 

t =0 i£l< x 

Sb-b^aiMSh+iyb* 

\ci\>l/r 


- < (1 + 0(1)) • C 2 ■ 2 i^r}) ■ log 6 {slllyl ■ 


Ci-jx” 


log ((1 + S b x )(l + j x )) 


m 


< (1 + o(l)) • C 2 ■ log 


C x ■ x m (log 6 (l + S b x ) + log b (l + r x )) 


S b + 1 


m 


= (! + o(l)) • (log b (l + S b 1 ) + log 6 (l + r x )) • (C 2 ■ g(x)), 


which is the desired bound. If on the other hand we take g(x) = log log x, it is easy to check 
that the proof given in [6] works mutatis mutandis in our case, the only significant difference 
being the additional factor of C 2 . □ 


We now proceed with the proof of 13.11 Applying Lemma 13.11 to bound the second sum 
on the right-hand-side of (13.2j) from above and below and combining the result with the our 
bound (j3.3[) on the first sum yields that 


< 


< 


(1 + o(l)) • (log b (l + S b 1 )- log 6 (l + r x )) • {C 2 • g(x)) 


E 

a,i£A(b,S b ) 


1 

i 


(1 + o(l)) • (log 6 (l + S b x ) + log 6 (l + r x ) + 2/r([0, r x ])) • (C 2 ■ g(x)) + K ■ g(r). 
















EVAN CHEN, PETER S. PARK, AND ASHVIN A. SWAMINATHAN 


Let e G (0, log b 2), and take r > (b £ — 1) 1 so that 2/x[(0, r x )] < e. Further taking x > 
max{r 1//£ ,exp((logr) 1 / £ )}, we find that 


< 


< 

< 


(1 + o(l)) • (log b (l + S b x ) 

E 1 

1 <Lx 

ai&A{b,Sb) 


e) ■ (C 2 ■ g(x)) 


(1 + o(l)) • ^log 6 (l + S b *) + 2e + ^ _ 9 g(x)) ' ^ 2 ' 
(1 + o(l)) ■ ^log b (l + S b x ) + 2e + —‘ (C *2 • g{x)), 


Taking e —> 0, we obtain the desired result. 


4. Frobenius Traces of Newforms 

We now apply the results of Theorems 11.11 and 11.21 to study the Frobenius traces of new¬ 
forms. As the case of newforms without complex multiplication is studied in [6], we shall 
consider the case of newforms with complex multiplication; however, we note that the theo¬ 
rems stated in this section hold in both cases. 

Given a newform / G 5^ ew (Fo(A^)) of even weight k > 2 and trivial nebentypus on To (IV) 
that has complex multiplication by an order in a (necessarily imaginary quadratic) number 
field K. let a,f(p) denote the trace of Frobenius of / at p for primes p. Recall that a,f(p) = 0 
for primes p if and only if p is inert or ramified in Ok- Thus, we restrict our attention 
to the traces of Frobenius a,f(p) at primes p that split in Ok (discarding the finitely many 
ramified primes). For convenience, let P k denote the set of primes that split in Ok, and 
for every x > 0, let P k< x = {p G Px : p < x}. By the Chebotarev Density Theorem, F K 
has arithmetic density, and hence logarithmic density, equal to - in the set of all primes. 
Therefore, we have that 

V- = (1 + o(l)) • loglogx V - = (l + o(l)) • - -log log a;, 

p p 2 

p<x p<3?x< x 

where the asymptotic on the left-hand-side of the above implication follows from the proof 
of Dirichlet’s Theorem for primes in arithmetic progressions. 

For each prime p G P#, let cos0 p = af(p)/(2p~2~) G [—1,1]; this is well-defined by the 
Hasse bound. Recall that a newform has complex multiplication by an imaginary quadratic 
held K if and only if it comes from a Grossencharakter of K (see Proposition 4.4 and 
Theorem 4.5 of ra>- So, by Hecke’s equidistribution result [5j for the angles given by 
Grossencharakters of imaginary quadratic fields over Q, we have the following: 

Theorem 4.1. Let f G S'^ ew (Fo(A^)) be a newform of even weight k > 2 and trivial neben¬ 
typus on To(A r ) that has complex multiplication by an order in a (necessarily imaginary 
quadratic) number field K. Then, for any subinterval [A, B] C [—1,1], we have that 

#{p < x : p G F k and cos 6 P G \A, B ]} 

hm —-- - ---— 

x^-oo #{p < x : p G P_^} 


AlAB]), 
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where pi is the complex-multiplication analogue of the Sato- Tate measure, defined by 


(4.1) 


dpi 


1 dt 
TT y'l-t 2 ' 


Note that the measure pi defined in (14.11) has the desired convexity property. From the 
above discussion, it follows that the sequence {a,f(p) : p prime} fulfills the hypotheses of 
Theorems 11.11 and 11.21 Thus, we obtain the following results as immediate corollaries: 


Theorem 4.2. Retain the setting of Theorem \f . 1\ The sequence {af{p)} P ev K n °t arith¬ 
metically Benford in any base b > 2. 


Theorem 4.3. Retain the setting of Theorem The sequence {af(p)} p ^p K is logarithmi¬ 
cally Benford in every base b > 2. 


In summary, we have shown that the Benford’s Law-type results on the Frobenius traces of 
newforms without complex multiplication proven in [6j also hold for newforms with complex 
multiplication. 
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